Using Topic Shifts in XML Retrieval at INEX 2006
نویسندگان
چکیده
This paper describes the retrieval approaches used by Queen Mary, University of London in the INEX 2006 ad hoc track. In our participation, we mainly investigate element-specific smoothing method within the language modelling framework. We adjust the amount of smoothing required for each XML element depending on its number of topic shifts to provide a focused access to XML elements in the Wikipedia collection. We also investigate whether using non-uniform priors is beneficial for the ad hoc tasks.
منابع مشابه
Using Topic Shifts for Focussed Access to XML Repositories
In focussed XML retrieval, a retrieval unit is an XML element that not only contains information relevant to a user query, but also is specific to the query. INEX defines a relevant element to be at the right level of granularity if it is exhaustive and specific to the user’s request – i.e., it discusses fully the topic requested in the user’s query and no other topics. The exhaustivity and spe...
متن کاملUnderstanding Differences between Search Requests in XML Element Retrieval
XML retrieval, a very active branch of IR, studies the focused retrieval of semi-structured data. Although much progress has been made, especially through the annual INitiative for the Evaluation of XML retrieval (INEX), very little is known about XML element retrieval in action: What do users expect from an element retrieval system? What kind of information needs do they have? What sort of res...
متن کاملWhat does Shakespeare have to do with INEX? User Queries, Assessment Behaviour and Best Entry Point Selection Strategies in XML Retrieval
Since 2002, the INitiative for the Evaluation of XML Retrieval (INEX) has been building an XML test collection for the evaluation of content-oriented XML search systems. In 2006, INEX extended its range of investigated user tasks to include the Best in Context task, where systems are required to return Best Entry Points (BEPs) to the user. In this paper we take a look back at a small user study...
متن کاملOverview of INEX 2006
Since 2002, INEX has been working towards the goal of establishing an infrastructure, in the form of a large XML test collection and appropriate scoring methods, for the evaluation of content-oriented XML retrieval systems. This paper provides an overview of the work carried out as part of INEX 2006.
متن کاملUsing Language Models and the HITS Algorithm for XML Retrieval
Our submission to the INEX 2006 Ad-hoc retrieval track is described. We study how to utilize the Wikipedia structure (XML documents with hyperlinks) by combining XML and Web retrieval. In particular, we experiment with different combinations of language models and the HITS algorithm. An important feature of our techniques is a filtering phase that identifies the relevant part of the corpus, pri...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006